NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

A Numerically Verified Parameter-Free Third Order Accurate Reconstruction Method for Cell-Centered Finite Volume Solvers on Arbitrarily Unstructured Grids

https://doi.org/10.2514/6.2025-0074

Tu, Shuangzhang; Jiang, Chao (January 2025, American Institute of Aeronautics and Astronautics)

This paper presents the implementation of a parameter-free third-order recon- struction method for cell-centered finite volume solvers on unstructured grids. The reconstruction is based on nodal gradients obtained using the least squares approach from solutions at adjacent cell centers. The cell and face gradients are computed by simple arithmetic averaging of vertex gradients, while the face values are obtained through quadratic interpolation. Importantly, the current reconstruction method does not require explicit second derivatives, and its stencil remains as compact as that used in traditional linear reconstruction methods. The third-order accuracy of the left and right states at the face values, along with the second-order accuracy of the face gradients, is numerically verified on various unstructured grids. This verified third-order accuracy is a crucial condition for ensuring the overall accuracy of the finite volume solver.
more » « less
Free, publicly-accessible full text available January 3, 2026
HgPCN: A Heterogeneous Architecture for E2E Embedded Point Cloud Inference

https://doi.org/10.1109/MICRO61859.2024.00116

Gao, Yiming; Jiang, Chao; Piard, Wesley; Chen, Xiangru; Patel, Bhavesh; Lam, Herman (November 2024, IEEE)

Point cloud is an important type of geometric data structure for many embedded applications such as autonomous driving and augmented reality. Current Point Cloud Networks (PCNs) have proven to achieve great success in using inference to perform point cloud analysis, including object part segmentation, shape classification, and so on. However, point cloud applications on the computing edge require more than just the inference step. They require an end-to-end (E2E) processing of the point cloud workloads: pre-processing of raw data, input preparation, and inference to perform point cloud analysis. Current PCN approaches to support end-to-end processing of point cloud workload cannot meet the real-time latency requirement on the edge, i.e., the ability of the AI service to keep up with the speed of raw data generation by 3D sensors. Latency for end-to-end processing of the point cloud workloads stems from two reasons: memory-intensive down-sampling in the pre-processing phase and the data structuring step for input preparation in the inference phase. In this paper, we present HgPCN, an end-to-end heterogeneous architecture for real-time embedded point cloud applications. In HgPCN, we introduce two novel methodologies based on spatial indexing to address the two identified bottlenecks. In the Pre-processing Engine of HgPCN, an Octree-Indexed-Sampling method is used to optimize the memory-intensive down-sampling bottleneck of the pre-processing phase. In the Inference Engine, HgPCN extends a commercial DLA with a customized Data Structuring Unit which is based on a Voxel-Expanded Gathering method to fundamentally reduce the workload of the data structuring step in the inference phase. The initial prototype of HgPCN has been implemented on an Intel PAC (Xeon+FPGA) platform. Four commonly available point cloud datasets were used for comparison, running on three baseline devices: Intel Xeon W-2255, Nvidia Xavier NX Jetson GPU, and Nvidia 4060ti GPU. These point cloud datasets were also run on two existing PCN accelerators for comparison: PointACC and Mesorasi. Our results show that for the inference phase, depending on the dataset size, HgPCN achieves speedup from 1.3× to 10.2× vs. PointACC, 2.2× to 16.5× vs. Mesorasi, and 6.4× to 21× vs. Jetson NX GPU. Along with optimization of the memory-intensive down-sampling bottleneck in pre-processing phase, the overall latency shows that HgPCN can reach the real-time requirement by providing end-to-end service with keeping up with the raw data generation rate.
more » « less
Free, publicly-accessible full text available November 2, 2025
Implementation of the High-Performance Computing Summer Institute at Jackson State University

https://doi.org/10.46328/ijonest.200

Tu, Shuang Z; Jiang, Chao (June 2024, International Journal on Engineering, Science and Technology)

Between May 25, 2023 and June 21, 2023, we hosted the inaugural four-week High-Performance Computing Summer Institute at Jackson State University. This endeavor was made possible through the support of a three-year NSF CISE-MSI grant. The primary objective of this Summer Institute revolved around the engagement, education, and empowerment of minority and underrepresented students in the realm of High-Performance Computing (HPC) within the field of engineering. Nine undergraduate students with diverse background were recruited to participate in this program. Throughout the program, we immersed these students in a comprehensive curriculum that covered various critical facets of HPC. This curriculum encompassed hands-on instruction in Linux operating system command-line operations, C programming within the Linux environment, fundamental HPC concepts, parallel computing utilizing the Message Passing Interface (MPI) library, and GPU computing through OpenCL. Additionally, we delved into foundational aspects of fluid mechanics, geometric modeling, mesh generation, flow simulation via our in-house flow solvers, and the visualization of solutions. At the end of the program, every participant was tasked with delivering an oral presentation and submitting a written report encapsulating their acquired knowledge and experiences during the program. We are excited to share a detailed overview of our program's implementation with our audience. This includes insights into our utilization of ChatGPT to enhance C programming learning and our suggestion of the NSF ACCESS resources to gain access to HPC systems. We are proud to announce that the program has achieved remarkable success, as evidenced by the positive feedback we received from the participants.
more » « less
Full Text Available
Distributed Sampling-Based Model Predictive Control via Belief Propagation for Multi-Robot Formation Navigation

https://doi.org/10.1109/LRA.2024.3368794

Jiang, Chao (April 2024, IEEE Robotics and Automation Letters)

Full Text Available
Incorporating Control Barrier Functions in Distributed Model Predictive Control for Multirobot Coordinated Control

https://doi.org/10.1109/TCNS.2023.3290430

Jiang, Chao; Guo, Yi (March 2024, IEEE Transactions on Control of Network Systems)

Full Text Available
A Robust Mesh Moving Method for Moving-Boundary Problems

https://doi.org/10.2514/6.2024-1163

Tu, Shuangzhang; Jiang, Chao (January 2024, American Institute of Aeronautics and Astronautics)

This paper presents a robust mesh moving solver developed to address moving boundary problems. Crucially, the resulting deformed mesh retains the same topology as the original mesh without being overly distorted. The mesh is treated as an elastic material, and the deformation of the computational domain resulting from moving boundaries is determined by solving the equilibrium linear elasticity equations. The linear elasticity equations are discretized by the classic Galerkin finite element method and solved by the block conjugate gradient iterative method. To maintain the quality of the mesh after motion, the Young's modulus of each element is weighted by the reciprocal of the distance between the element center and the moving boundaries. The effectiveness of this approach is demonstrated through a set of 2D and 3D test cases featuring prescribed translational and/or rotational motion of the embedded object. The method is now ready for integration into our existing in-house CFD solvers.
more » « less
Full Text Available
MedReadMe: A Systematic Study for Fine-grained Sentence Readability in Medical Domain

https://doi.org/10.18653/v1/2024.emnlp-main.958

Jiang, Chao; Xu, Wei (January 2024, Association for Computational Linguistics)

Full Text Available
End-to-end decentralized formation control using a graph neural network-based learning method

https://doi.org/10.3389/frobt.2023.1285412

Jiang, Chao; Huang, Xinchi; Guo, Yi (November 2023, Frontiers in Robotics and AI)

Multi-robot cooperative control has been extensively studied using model-based distributed control methods. However, such control methods rely on sensing and perception modules in a sequential pipeline design, and the separation of perception and controls may cause processing latencies and compounding errors that affect control performance. End-to-end learning overcomes this limitation by implementing direct learning from onboard sensing data, with control commands output to the robots. Challenges exist in end-to-end learning for multi-robot cooperative control, and previous results are not scalable. We propose in this article a novel decentralized cooperative control method for multi-robot formations using deep neural networks, in which inter-robot communication is modeled by a graph neural network (GNN). Our method takes LiDAR sensor data as input, and the control policy is learned from demonstrations that are provided by an expert controller for decentralized formation control. Although it is trained with a fixed number of robots, the learned control policy is scalable. Evaluation in a robot simulator demonstrates the triangular formation behavior of multi-robot teams of different sizes under the learned control policy.
more » « less
Full Text Available
Dislocation flow turbulence simultaneously enhances strength and ductility

https://doi.org/10.1073/pnas.2316912121

Chen, Yang; Feng, Hui; Li, Jia; Liu, Bin; Jiang, Chao; Liu, Yong; Fang, Qihong; Liaw, Peter K (March 2024, Proceedings of the National Academy of Sciences)

Multi-principal element alloys (MPEAs) exhibit outstanding strength attributed to the complex dislocation dynamics as compared to conventional alloys. Here, we develop an atomic-lattice-distortion-dependent discrete dislocation dynamics framework consisted of random field theory and phenomenological dislocation model to investigate the fundamental deformation mechanism underlying massive dislocation motions in body-centered cubic MPEA. Amazingly, the turbulence of dislocation speed is identified in light of strong heterogeneous lattice strain field caused by short-range ordering. Importantly, the vortex from dislocation flow turbulence not only acts as an effective source to initiate dislocation multiplication but also induces the strong local pinning trap to block dislocation movement, thus breaking the strength-ductility trade-off.
more » « less
Full Text Available
Automated Patient-Robot Task Assignment in a Simulated Stochastic Rehabilitation Gym

https://doi.org/10.1109/ICORR58425.2023.10304716

Miller, Benjamin A.; Adhikari, Bikranta; Jiang, Chao; Novak, Vesna D. (September 2023, IEEE)

« Prev Next »

Search for: All records